Figure 1 125P5C8 SSH sequence 

GATCACGTGCTGTCGATATCCTTCACATTGCCATGTTCAGTGAGCTGTAGATAATCTCTGGAGCCAGGTGCTGAAGTGA 
TATATCCCAGAAATATCACTTGATTAGAGCTACTTTTCAGTAGTTTTGAAACAGCAATAGCCTGCAGTTTCCTGTCGAG 
GTCATCTTCGTGGTTCCCAAAGTGTGTCACGACAAAATCCACCAGCTTGCCCGAAATGTTAACGGTCAATGTGATGGCT 
GGTGCGATCTTGCTGTGTTGGCCAGGCTGGTCTCAACGTGCAGATAGATC 
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Figure 2 Double stranded sequence and ORF for 125P5C8-Pro-pCR2 . 1 . 
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Figure 2 Cont'd 



1621 


CTG 


GTG 


GAT 


TTT 


GTC 


GTG 


ACA 


CAC 


TTT 


GGG 


AAC 


CAC 


GAA 


GAT 


GAC 


CTC 


GAC 


AGG 




K 


L 


Q 


A 


I 


A 


V 


S 


K 


L 


L 


K 


S 


S 


S 


N 


Q 


V 


1675 


AAA 


CTG 


CAG 


GCT 


ATT 


GCT 


GTT 


TCA 


AAA 


CTA 


CTG 


AAA 


AGT 


AGC 


TCT 


AAT 


CAA 


GTG 




I 


F 


L 


G 


Y 


*I 


T 


S 


A 


P 


G 


S 


R 


D 


Y 


L 


Q 


L 


1729 


ATA 


TTT 


CTG 


GGA 


TAT 


ATC 


ACT 


TCA 


GCA 


CCT 


GGC 


TCC 


AGA 


GAT 


TAT 


CTA 


CAG 


CTC 




T 


E 


H 


G 


N 


V 


K 


D 


I 


D 


S 


T 


D 


H 


D 


R 


W 


C 


1783 


ACT 


GAA 


CAT 


GGC 


AAT 


GTG 


AAG 


GAT 


ATC 


GAC 


AGC 


ACT 


GAT 


CAT 


GAC 


AGA 


TGG 


TGT 




E 


Y 


I 


M 


Y 


R 


G 


L 


I 


R 


L 


G 


Y 


A 


R 


I 


S 


' H 


1837 


GAA 


TAC 


ATT 


ATG 


TAT 


CGA 


GGG 


CTG 


ATC 


AGG 


TTG 


GGT 


TAT 


GCA 


AGA 


ATC 


TCC 


CAT 




A 


E 


L 


S 


D 


S 


E 


I 


Q 


M 


A 


K 


F 


R 


I 


P 


D 


D 


1891 


GCT 


GAA 


CTG 


AGT 


GAT 


TCA 


GAA 


ATT 


CAG 


ATG 


GCA 


AAA 


TTT 


AGG 


ATC 


CCT 


GAT 


GAC 




P 


T 


N 


Y 


R 


D 


N 


Q 


K 


V 


V 


I 


D 


H 


R 


E 


V 


S 


1945 


CCC 


ACT 


AAT 


TAT 


AGA 


GAC 


AAC 


CAG 


AAA 


GTG 


GTC 


ATA 


GAC 


CAC 


AGA 


GAA 


GTT 


TCT 




E 


K 


I 


H 


F 


N 


P 


R 


F 


G 


S 


Y 


K 


E 


G 


H 


N 


Y 


1999 


GAG 


AAA 


ATT 


CAT 


TTT 


AAT 


CCC 


AGA 


TTT 


GGA 


TCC 


TAC 


AAA 


GAA 


GGA 


CAC 


AAT 


TAT 




E 


N 


N 


H 


H 


F 


H 


M 


N 


T 


P 


K 


Y 


F 


L 


* 






2053 


GAA 


AAC 


AAC 


CAT 


CAT 


TTT 


CAT 


ATG 


AAT 


ACT 


CCC 


AAA 


TAC 


TTT 


TTA 


TGA 


AAC 





Figure 3 Amino acid sequence of 125P5C8. 



1 MTSLWREILL ESLLGCVSWS LYHDLGPMIY YFPLQTLELT GLEGFSIAFL 5 0 
51 SPIFLTITPF WKLVNKKWML TLLRIITIGS IASFQAPNAK LRLMVLALGV 100 
101 SSSLIVQAVT WWSGSHLQRY LRIWGFILGQ IVLWLRIWY TSLNPIWSYQ 15 0 
151 MSNKVILTLS AIATLDRIGT DGDCSKPEEK KTGEVATGMA SRPNWLLAGA 2 00 

2 01 AFGSLVFLTH WVFGEVSLVS RWAVSGHPHP GPDPNPFGGA VLLCLASGLM 250 
251 LPSCLWFRGT GL I WWVTGTA SAAGLLYLHT WAAAVSGCVF AIFTASMWPQ 3 00 

3 01 TLGHLINSGT NPGKTMTIAM IFYLLEIFFC AWCTAFKFVP GGVYARERSD 3 50 
351 VLLGTMMLII GLNMLFGPKK NLDLLLQTKN SSKVLFRKSE KYMKLFLWLL 4 00 

4 01 VGVGLLGLGL RHKAYERKLG KVAPTKEVSA AIWPFRFGYD NEGWSSLERS 4 50 
451 AHLLNETGAD FITILESDAS KPYMGNNDLT MWLGEKLGFY TDFGPSTRYH 500 
501 TWGIMALSRY PIVKSEHHLL PSPEGEIAPA ITLTVNISGK LVDFWTHFG 550 
551 NHEDDLDRKL QAIAVSKLLK SSSNQVIFLG YITSAPGSRD YLQLTEHGNV 6 00 
601 KDIDSTDHDR WCEYIMYRGL IRLGYARISH AELSDSEIQM AKFRIPDDPT 650 
651 NYRDNQKWI DHREVSEKIH FNPRFGSYKE GHNYENNHHF HMNTPKYFL 6 99 




Figure 4A Alignment with AK025164 protein product 

Score = 1397 bits (3615), Expect =0.0 

Identities = 682/699 (97%), Positives = 683/699 (97%) 

Query : 1 MTSLWREILLESLLGCVSWSLYHDLGPMIYYFPLQTLELTGLEGFSIAFLSPIFLTITPF 60 

MTSLWREILLESLLGCVSWSLYHDLGPMIYYFPLQTLELTGLEGFSIAFLSPIFLTITPF 
Sbjct : 1 MTSLWREILLESLLGCVSWSLYHDLGPMIYYFPLQTLELTGLEGFSIAFLSPIFLTITPF 60 

Query : 61 WKIjWKKWMLTLLRIITIGSIASFQAPNAKLRLMVLALGVSSSLIVQAVTWWSGSHLQRY 120 

WKLWKKWMLTLLRIITIGSIASFQAPNAKLRLMVLALGVSSSLIVQAVTWWSGSHLQRV 
Sbjct : 61 WKLVNKKWMLTLLRIITIGSIASFQAPNAKLRLMVLALGVSSSLIVQAVTWWSGSHLQRY 120 

Query: 121 LRIWGFILGQIVLWLRIWYTSLNPIWSYQMSNKVILTLSAIATLDRIGTDGDCSKPEEK 180 

LRIWGFILGQIVLWLRIWYTSLNPIWSYQMSNKVILTLSAIATLDRIGTDGDCSKPEEK 
Sbjct: 121 LRIWGFILGQIVLWLRIWYTSLNPIWSYQMSNKVILTLSAIATLDRIGTDGDCSKPEEK 180 

Query: 181 KTGEVATGMASRPNWLLAGAAFGSLVFLTHWVFGEVSLVSRWAVSGHPHPGPDPNPFGGA 24 0 

KTGEVATGMASRPNWLLAGAAFGSLVFLTHWVFGEVSLVSRWAVSGHPHPGPDPNPFGGA 
Sbjct: 181 KTGEVATGMASRPNWLLAGAAFGSLVFLTHWVFGEVSLVSRWAVSGHPHPGPDPNPFGGA 240 

Query: 241 VLLCLASGLMLPSCLWFRGTGLIWWTGTASAAGLLYLHTWAAAVSGCVFAIFTASMWPQ 300 

VLLCLASGLMLPSCLWFRGTGLIWWVTGTASAAGLLYLHTWAAAVSGCVFAIFTASMWPQ 
Sbjct: 241 VLLCLASGLMLPSCLWFRGTGLIWWVTGTASAAGLLYLHTWAAAVSGCVFAIFTASMWPQ 3 00 

Query: 301 TLGHL I NSGTNPGKTMT I AN I F YLLE I F FCAWCTAFKFVPGGVYARERSDVLLGTMML I I 360 

TLGHL INSGTNPGKTMTI AMI F YLLE I FFCAWCTAFKFVPGGVYARERSD VLLGTMML I I 
Sbjct: 301 TLGHL INSGTNPGKTMTI AMI F YLLE I FFCAWCTAFKF VPGGVYARERSDVLLGTMML I I 360 

Query: 3 61 GLNMLFGPKKNLDLLLQTKNSSKVLFRKSEKYMKXXXXXXXXXXXXXXXXRHKAYERKLG 42 0 

GLNMLFGPKKNLDLLLQTKNSSKVLFRKSEKYMK RHKAYERKLG 
Sbjct: 361 GLNMLFGPKKNLDLLLQTKNSSKVLFRKSEKYMKLFLWLLVGVGLLGLGLRHKAYERKLG 420 

Query: 421 KVAPTKE VS AAI WPFRFGYDNEGWS SLERS AHLLNETGADF IT I LESDAS KP YMGNNDLT 4 80 

KVAPTKE VS AAI WPFRFGYDNEGWS SLERS AHLLNETGADF IT I LESDAS KP YMGNNDLT 
Sbjct: 421 KVAPTKEVSAAIWPFRFGYDNEGWSSLERSAHLLNETGADF IT I LESDAS KP YMGNNDLT 480 

Query: 481 MWLGEKLGFYTDFGPSTRYHTWGIMALSRYPIVKSEHHLLPSPEGEIAPAITLTVNISGK 54 0 

MWLGEKLGFYTDFGPSTRYHTWGIMALSRYPIVKSEHHLLPSPEGEIAPAITLTVNISGK 
Sbjct: 481 MWLGEKLGFYTDFGPSTRYHTWGIMALSRYPIVKSEHHLLPSPEGEIAPAITLTVNISGK 540 

Query: 541 LVTJF WTHFGNHEDDLDRKLQAI AVS KLLKS S SNQVI FLGY ITS APGSRD YLQLTEHGNV* 60 0 

LVDF WTHFGNHEDDLDRKLQAI AVS KLLKS S SNQV I FLGY I TS APGSRD YLQLTEHGNV 
Sbjct: 541 LVDF WTHFGNHEDDLDRKLQAI AVS KLLKS S SNQVI FLGY ITS APGSRD YLQLTEHGNV 600 

Query: 601 KDIDSTDHDRWCEYIMYRGLIRLGYARISHAELSDSEIQMAKFRIPDDPTNYRDNQKWI 660 

KDIDSTDHDRWCEYIMYRGLIRLGYARISHAELSDSEIQMAKFRIPDDPTNYRDNQKWI 
Sbjct: 601 KDIDSTDHDRWCEYIMYRGLIRLGYARISHAELSDSEIQMAKFRIPDDPTNYRDNQKWI 660 

Query: 661 DHREVSEKIHFNPRFGSYKEGHNYENNHHFHMNTPKYFL 699 

DHREVSEKIHFNPRFGSYKEGHNYENNH+FHMNTPKYFL 
Sbjct: 661 DHREVSEKIHFNPRFGSYKEGHNYENNHNFHMNTPKYFL 699 



Figure 4B Alignment with yeast YCR017 cp Protein 

Score = 261 .bits (668) , Expect = le-68 

Identities = 204/705 (28%), Positives = 330/705 (45%), Gaps = 50/705 (7%) 

Query : 15 GCVSWSLYHDLGPMIYYFPLQTLELTGLEGFSIAFLSPIFLTITPFWKLVNKKWMLTLLR 74 

G + WS L I++FPL + ++G E + +L PIFL + PF ++ + L 

Sbjct: 279 GFLFWSNVTSLLCS IWHFPLWYMGISGYEAAILGYLGP I FLYL - PFVSEAFTQYGVLLGG 337 

Query: 75 IITIGSIASFQAPNAKLRLMVLALGVSSSLIVQAVTWWSGSHLQ-RYLRIWGFILGQIVL 13 3 

II IG+ QP+L++ +++VQ++++ + W +LG + 

Sbjct: 338 IIAIGAYI-VQMPELRLISVAVGTSITVATFVQNLRYITNAETSFSFALTW- -LLGLVAS 394 

Query: 134 WLRIWYTSLNPIWSYQMS NKVILTLSAIATLDRIGTDGDCSKPEEKKTGEVATG 188 

V+L+^- + + NP W NK L L+ + + + + E K+ ' + + 

Sbj Ct : 3 95 VILKMGFYTNNPTWVILDERNGGYNKTALVLTVLFGM- -LSPYVNSINFEGKRNAQAKS - 451 

Query: 189 MASRPNWLLAGAAFGSLVFLTHWVFGEVSLVSRWAVSGHPHP-GPDPNPFGGAVLLCLAS 247 

AS L FGSL+F H + + S WA G+ GP P P+G L C 

Sbjct: 452 -ASLIGKLFLAVGFGSLLFGIHQLLTDSSTTIYWAWEGYNESHGPLPWPWGA- -LTCTVM 508 

Query: 248 GLMLPSCLWFRGTGLIWWVTGTASAAGLLY- -LHTWAAAV- SGCVFAIFTASMWPQ T 301 

S + FGL++ SAL + W + G ++AI + P 

Sbjct: 509 LFASLSSVKFMGKPLVPCLLLLISTAVLSARSITQWPKYIFGGLLYAIAMLWLVPSYFSA 568 

Query: 302 LGHL INSGTNPGKTMT I AMI F YLLE I FFCAWCTAFKFVPGGVYARERSD VLLGTMML I I G 361 

LG + N ++ Y+ + +' W A+ FVP G RE + + +L I 

Sbjct: 569 LGQVQNIWV YVL S F S VY 1 1 F VL AHVWVVA YAF VPMG W VLRE K I E T VL AF S S T F 1 1 623 

Query: 3 62 LNMLFGPKKNLDLLLQTKNSSKVLFRKSEKYMKXXXXXXXXXXXXXXXXRHKAYERKLGK 421 

+ L N+ L+ K +F R R G 

Sbjct: 624 IGALTCKNLNIQLVTMGKKFF I YVF F FAVALL S LTARF VYD I RPTG I 670 

Query: 422 VAP TKEVSAAIWPFRFGYDNEGWSSLERSAHLLNETGADFITILESDASKPYMGNN 477 

P ++ ++A IW FG DN+ W+S +R +L+ + D + +LE+D + MGN 

Sbjct: 671 PQPYHPDSQLITAGIWTIHFGLDNDMWASEDRMINLIKDMELDWGLLETDTQRITMGNR 730 

Query: 478 DLTMWLGEKLGF YTDFGPSTR YHTWG IMALSRYP I VKS EHHLLPS PEGE I APAI TLTV - N 536 

DLT L L Y DFGP HTWG + LS+ + PIV S HHLLPSP GE+APAI T+ 
Sbjct: 731 DLTSKLAHDLNMYADFGPGPNKHTWGCVLLSKFPIVNSTHHLLPSPVGELAPAIHATLQT 790 

Query: 537 ISGKLVDFWTHFGNHEDDLDRKLQAIAVSKLLKSSSNQVIFLGYITSAPGSRDY-LQLT 595 

+ LVD V H G ED+ DR+LQ+ ++KL+ +++ I L Y+ PG +Y ++ 
Sbjct: 791 YNDTLVDVFVFHSGQEEDEEDRRLQSNYMAKLMGNTTRPAILLSYLWDPGEGNYNTYVS 850 

Query: 596 EHGNVKDIDSTDHDRWCEYIMYRGLIRLGYARISHAELSDSEIQMAKFRIPDDPTNYRDN 655 

E + DID +D DRWCEYI + YRGL R GYAR++ ++D+E+Q+ KF++ + ++ , 

Sbjct: 851 ETSGMHDIDPSDDDRWCEYILYRGLRRTGYARVARGTITDTELQVGKFQVLSEQA-LVEH 909 

Query: 656 QKWIDHREVSEKIHFNPRFGSYKEGHNYENNHHFHM-NTPKYFL 699 

+ ++ +SE + + +F G EH +H+ + P+Y+L 

Sbjct: 910 SDSMYEYGHMSEPE YEDMKFPDKFLGEG - ERGHFYHVFDEPRYYL 953 



